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Abstract. We discuss the relation between the Wasserstein distance of order 1 between 
probability distributions on a metric space, arising in the study of Monge-Kantorovich 
transport problem, and the spectral distance of noncommutative geometry. Starting from 
a remark of Rieffel on compact manifolds, we first show that on any - i.e. non-necessary 
compact - complete Riemannian spin manifolds, the two distances coincide. Then, on convex 
manifolds in the sense of Nash embedding, we provide some natural upper and lower bounds 
to the distance between any two probability distributions. Specializing to the Euclidean 
space K", we explicitly compute the distance for a particular class of distributions genera- 
lizing Gaussian wave packet. Finally we explore the analogy between the spectral and the 
Wasserstein distances in the noncommutative case, focusing on the standard model and the 
Moyal plane. In particular we point out that in the two-sheet space of the standard model, 
an optimal-transport interpretation of the metric requires a cost function that does not va- 
nish on the diagonal. The latest is similar to the cost function occurring in the relativistic 
heat equation. 
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1 Introduction 

The idea at the core of Noncommutative Geometry [11] is the observation that, in many in- 
teresting cases, the description of set of points is inadequate. Think for example 
of quantum mechanics, where position and momentum are replaced by non-commuting opera- 
tors: as a consequence, Heisenberg uncertainty relations impose limitations in the precision of 
their simultaneous measurement so that the notion of "point in phase space" loses any opera- 
tional meaning. Taking into account General Relativity, one can show by simple arguments that 
not only phase-space coordinates but also space-time coordinates should be non-commutative 
(cf. [17, 18] and references therein) making the concept of "points in space-time" also problem- 
atic. Noncommutative Geometry provides efficient tools to study these "spaces" that are no 
longer described by a commutative algebra of coordinate functions, but by some noncommutative 
operator algebra A. 

Losing the notion of points, one also loses the notion of distance between points. However 
one can still define a distance between states of the algebra A, for example with the help of 



*This paper is a contribution to the Special Issue "Noncommutative Spaces and Fields". The full collection is 
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a generalized Dirac operator D. The latter is the starting point of Connes theory of spectral 
triples [12], which is the datum {A, 71,0) of an involutive (non necessarily commutative) algeb- 
ra A, a representation vr of ^ as bounded operators on a Hilbert space 7i and a self-adjoint 
operator D, such that [D, a] is bounded and a{D — X)~^ is compact for any a £ A and A ^ Sp(D) 
(where the symbol vr is omitted) . The spectral distance between two states ipi , ip2 of A is defined 
as [10, 13] 

dD{^i,ip2) = sup{\ipi{a) - ip2{a)\; || [D, a]| |op < l}, (1.1) 

where the norm is the operator norm coming from the representation of ^ on 7^. It is easy to 
check that (1.1) defines a distance in a strict mathematical sense except that it may be infinite. 

Recall that a state is by definition a positive linear application A ^ C with norm 1, where A is 
the C*-algebra completion of A. When A is the algebra of observables of a physical system, this 
notion of state coincides with physicist's intuition, namely density matrices or Gibbs states in 
statistical physics, state vectors or wave functions in quantum mechanics. For the commutative 
algebra Cq^(A^) - i.e. complex smooth functions vanishing at infinity on some spin manifold A4 
of dimension^ m - a canonical spectral triple is 

{C^iM),L\M,S),D), (1.2) 

where L'^{Ai,S) is the Hilbert space of square integrable spinors on which C^{Ai) acts by point- 
wise multiplication and D is the Dirac operator. The latest is given in local coordinates by 



m 



where = d^ + uj^ is the covariant derivative associated to the spin connection 1-form oj^^ and 
{7^}/i=i,2,...,m are the self-adjoint Dirac gamma matrices satisfying 

T^T*" + 7 V = 25^''I (1.3) 

with g the Riemannian metric of M and I the identity matrix of dimension 2" if m = 2n or 
2n + 1. In this case A = Co(A^) and the spectral distance between pure states (i.e. states that 
cannot be written as a convex sum of other states), which by Gelfand transform are in one-to-one 
correspondence with the points of the manifold 

X ^ 4 : 4(/) = /(x) V/GCo(A1), (1.4) 

coincides [15] with the geodesic distance d of 7W, 

dD{Sx,Sy) = d{x,y) yx,y£M. (1.5) 

In a noncommutative framework it is tempting - inspired by (1.4) - to take the set V^A) 
of pure states of A as the noncommutative analogue of points and d/j as natural generalization 
of the geodesic distance. This idea has been tested in several examples inspired by physics: 
finite-dimensional algebras [5, 26], functions on Ai with value in a matrix algebra [35, 32, 33] 
encoding the inner structure of the space-time of the standard model of particles physics [8], 
non-commutative deformations of C^{Ai) [7]. Most often the computation of the supremum 
in (1.1) is quite involved, however several explicit results have been obtained. They all indicate 
that as soon as A is noncommutative one looses an important feature of the geodesic distance. 



^Unless otherwise specified, in all the paper we assume that "spin manifolds" are Riemannian, finite- 
dimensional, connected, complete, without boundary. 
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namely V{A) is no longer a path metric space [25]. Explicitly there is no curve [0, 1] 3 t ipt G 
V{A) such that 

doi^Ps^Vt) = \t - s\dDi^Po,^i), (1.6) 
not even a sequence of such curves ip"' such that 

dnifo, = Inf {length of ip"' between ipQ and ipi} . 

n 

This can be seen on a simple noncommutative examples studied in [26], based on ^ = M2(C) 
acting on with D = D* £ M2(C). V{M2{C)) is weak* homeomorphic to the Euchdean 2- 
sphere. The image under this homeomorphism of the pure states coi = {tpi, .ipi), i = 1,2, defined 
by the eigenvectors tpi of D are antipodal, and determine a distinguished vertical axis on S'^. 
The spectral distance is invariant by rotation around this axis and the connected components 
(see (2.16) below) are circles parallel to the horizontal plane. Explicitly, on the circle with radius 
r G [0, 1] one finds 



2r 

dDi0i,92) 



LDi - Do 



sm ■ 



(1.7) 



where 9 £ [0, 2tt[ is the azimuth and Di are the eigenvalues of D. One then checks^ that (1.6) has 
no solution in V{M2{C)) [34]. The lack of geodesic curves (1.6) within V{A) is cured by conside- 
ring non-pure states. Indeed (1.1) not only generalizes the geodesic distance to noncommutative 
algebras, it also extends the distance to objects that are not equivalent to points, namely non- 
pure states. Noticing that (1.7) is the geodesic distance within the Euclidean disk of radius 
[j^^jj^l 'iiid that the set 5(M2(C)) of states of M2(C) is homeomorphic to the 2-dimensional 
Euclidean ball (see Section 4.1), one easily obtains a curve in 5(M2(C)) satisfying (1.6), namely 

(pt = {I- t)ipo + tipi. 

This remains true in full generality since, whatever algebra A and operator D, 

doi^s^'Pt) = sup {|(s - t){'pQ - 'pi){a)\; ||[D,a]||op < l} = |s - t\dD{p>o,^pi). (1.9) 

aG.4 

In other terms, in view of (1.9) and (1.5), the spectral distance is a natural generalization of 
the geodesic distance to the noncommutative setting as soon as one takes into account the whole 
space of states, and not only its extremal points. This motivates the study of the spectral distance 
between non-pure states that we undertake in this paper. We begin by giving a detailed proof 
of Rieffel's remark [39] (also mentioned in [4]), according to which in the commutative case A = 
C^{Ai) the spectral distance coincides with a distance well known in optimal transport theory, 
namely the Wasserstein distance W of order 1 (see the bibliographic notes of [44, Chapter 6]). 
We stress in particular that M has to be complete for that result to hold besides the compact 
case. Then we present few explicit calculations of do between non-pure states, and question 
on simple examples - including the standard model - the pertinence of the optimal transport 
interpretation of the spectral distance in a noncommutative framework. These are preliminaries 



curve 1 1-> 9{t) satisfies (1.6) if and only if 



^K\t-s\ (1.8) 



for any t,s £ [0, 1] and K a eonstant. The right hand side of (1.8) being a function of t — s, there exists a function / 



such that the left hand side is f{t ~ s). Putting s — 0, one obtains f{t) = | sin ■ ' \. Reinserted in (1.8), this 

yields Cauchy's functional equation 6{t)—6{s) — 6{t~ s) ~ 6(0), whose continuous solutions are linear. Since (1.8) 
has no linear solutions, this proves the claim. 
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results, intending to bring the attention of the transport theory community on the metric aspect 
of noncommutative geometry, and vice-versa. 

Notice that some properties of the spectral distance between non-pure states have been 
investigated in [39]: considering instead of ||[D,a]||op an arbitrary semi-norm on A, it is shown 
that the knowledge of the distance between pure states of a noncommutative A may not be 
enough to recover the semi-norm on A. One also needs the distance between non-pure states. 
This suggests that the metric information encoded in formula (1.1) is not exhausted once one 
knows the distance between pure states. 

The plan of the paper is the following. In Section 2 we recall some basics of transport 
theory and noncommutative geometry in order to establish - in Proposition 2.1 - the equality 
between d^) and W for any complete spin manifold M. We also discuss various definitions 
of the spectral distance, characterize its connected components and emphasize the importance 
of the completeness condition. In Section 3 we provide some lower and upper bounds for the 
distance. Specializing to M = M", we explicitly compute the distance between a class of states 
generalizing Gaussian wave-packets. Section 4 deals with noncommutative examples. We show 
that on the truncations of the Moyal plane introduced in [7], the Wasserstein distance Wd with 
cost do defined on V{A) does not coincide with the spectral distance on S{A). But on almost- 
commutative geometries, including the standard model of elementary particles, the spectral 
distance between certain classes of states may be recovered as a Wasserstein distance W with 
cost d' defined on a subset of S{A) containing P(^). We also point out a reformulation of the 
spectral distance on V{A) in term of the minimal work Wj associated to a cost c/ non- vanishing 
on the diagonal (i.e. cj{x,x) / 0). 



2 Spectral distance as Wasserstein distance of order 1 
2.1 Spectral distance from Kantorovich duality 

For any locally compact Hausdorff topological space X, states ip £ S{Co{X)) are given by Borel 
probability measures ^ on X, via the rule 

ipif)= [ fdfi V/G A (2.1) 

This is a simple application of Riesz representation and Hahn-Banach theorems together with 
the assumption that X is cr-compact in order to avoid regularity problems. Any such ^ defines 
a state since / vanishing at infinity (hence being bounded) guarantees that (2.1) is finite. 
Pure states correspond to Dirac-delta measures. To provide some physical intuition, one can 
view ip as a wave-packet and imagine that it describes the probability distribution of a bunch 
of particles. Strictly speaking a wave-packet is a square root of the Radon-Nikodym derivative 
of d/x with respect to some fixed cr-finite positive measure dx on X (it is a square-integrable 
function, almost everywhere defined and unique modulo a phase, whenever d/i is absolutely 
continuous with respect to dx). For instance in quantum mechanics, with X = M" and dx 
the Lebesgue measure, a wave-packet is a function cj) £ L'^{W^) and the corresponding measure 
is d/x = |(/)(x)pdx. 

Assuming ^ is a metric space with distance function d, there is a natural way to measure 
how much two states (pi and 922 differ, which is the expectation value of the distance between 
the two corresponding distributions 

E{d; fii X fj,2) = (i(x,y)d/ui(x)d/i2(y), 
JxxX 

where fii is associated to cpi via (2.1). Other ways are suggested by transport theory (all material 
here is taken from [1, 43, 44, 6], where an extensive bibliography can be found. Following [43] 
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we assume from now on that X has a countable basis, so that it is a Polish space). Assume there 
exists a positive real function c{x, y) - the "cost function" - that represents the work needed to 
move from x to y. A good measure on how much the ipi's differ is given by the minimal work W 
required to move the bunch of particles from the configuration ipi to the configuration ip2, 
namely 

W{ipi,ip2) = mf [ cix,y)d7T, (2.2) 

where the infimum is over all measures n on X x X with marginals ^i, fi2 (i-e. the push- 
forwards of vr through the projections \,Y : X x X ^ X, X(x,y) = x, Y{x,y) = y, are 
X*(7r) = /ii and Y*(7r) = /U2). Such measures are called transportation plans. Finding the 
optimal transportation plan, that is the one which minimizes W, is a non-trivial question known 
as the Monge-Kantorovich problem. This is a generalization of Monge [36] "deblais et remblais" 
problem, where one considers only those transportation plans that are supported on the graph 
of a transportation map, i.e. a map T : X ^ X such that T*^i = /X2. Namely, 

W^Monge('/C'i,<^2) = inf / c{x ,T{x))dni{x) . (2.3) 
Jx 

One of the interests of Kantorovich's generalization [28] is that the infimum in (2.3) is not always 
a minimum: an optimal transportation map may not exist. On the contrary the infimum in (2.2) 
is a minimum and always coincides with Monge infimum, even when the optimal transportation 
map does not exist. Moreover when the cost function c is a distance d, (2.2) is in fact a distance 
on the space of states - with the infinite value allowed - called the Kantorovich-Rubinstein 
distance (this case was first studied in [29]). To be sure it remains finite (see (2.13) below), it 
is convenient to restrict to the set Si{Co{X)) of states whose moment of order 1 is finite, that 
is those distributions fi such that 

E{d{xo,o); n) = / d{xQ,x)dfj,{x) < +00, (2.4) 
Jx 

where xq is an arbitrary but fixed point in X. Note that as soon as K{d{xQ, o); fj.) is finite for xq, 
then by the triangle inequality K{d{x,o); n) < E((i(xo, o); /i) + d{xo,x) is finite for any x £ X 
so that Si{Co{X)) is independent on the choice of xq. The Kantorovich-Rubinstein distance is 
also known as the Wasserstein distance of order 1. The distance of order p is given by a similar 
formula with E((i^'; //i (8) ijl2Y^^i 1 < P < 00, but in this paper we are interested only in the 
distance of order one and we shall simply call W the Wasserstein distance. 

The link with non-commutative geometry, which seems to have been first noted for M com- 
pact in [39], is the following: when X \s a, spin manifold A^, the Wasserstein distance with 
cost function the geodesic distance d is nothing but the spectral distance (1.1) associated to 
the canonical spectral triple (1.2). This is a priori not difficult to see: On one side a central 
result of transport theory, Kantorovich duality, provides a dual formulation of the Wasserstein 
distance as a supremum instead of an infimum, namely (cf. Theorem 5.10 and equation (5.11) 
of [44]) 

W{^i,^2)= sup ( [ fdfii- [ fdfii) (2.5) 

ll/llLip<i,/6Li(m)nLMM2) V^A- Jx J 

for any pair of states in S'(Co(<^)) such that the right-hand side in the above expression is finite. 
The supremum is on all real [ii=\ 2-integrable functions / that are 1-Lipschitz, that is to say 



\f{x)- f{y)\<d{x,y) Vx,yE^. 
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On the other side, the commutator —i['y'^df^,f] (where we use Einstein summation convention 
and sum over repeated indices) acts on L'^{A4,S) as multiphcation by —i'y^d^f. Moreover the 
supremum in (1.1) can be searched on self-adjoint elements [26], that for A = Cq°(A^) simply 
means real functions /. Thus 

ii[^,/]iiop = = \\{7'd,f)irdj)\\o^ = \\urr+Yind,fduf\\or> 

I Lip 



"""d^fdJllop = Wg^'d^fdJW^ = ||/||2 (2.6) 



where we used: the C*-property of the norm, that d^f commutes with duf and the 7's matri- 
ces, equation (1.3), that ^ g^^d^j,fdi,f evaluated at x is the norm of the gradient V/(x) and 
suPxeA^ ll^/(2;)llTa;Af = 1 1 /I kip by Cauchy's mean value theorem. Consequently the commu- 
tator-norm condition in the spectral distance formula yields on / the condition required in 
Kantorovich's dual formula. However one has to be careful that, although the (^j's on the l.h.s. 
of (2.5) denote states of C^{M), the supremum on the r.h.s. includes functions non-vanishing 
at infinity. Therefore (1.1) equals (2.5) if and only if the supremum on 1-Lipschitz smooth 
functions vanishing at infinity is the same as the supremum on 1-Lipschitz continuous functions 
non- necessarily vanishing at infinity. That Cg°(7W) is dense within Co(A^) is well known, but 
it might be less known that continuous i^-Lipschitz functions can be approximated by smooth 
X-Lipschitz functions. In the following we use this result - proved for finite-dimensional man- 
ifolds in [24] and extended to infinite dimension in [2] - in order to to prove Rieffel's remark, 
generalized to complete locally compact manifolds (e.g. complete finite-dimensional manifolds). 

Proposition 2.1. For any (pi,f2 £ '5(C^(A^)) with M a {complete, Riemannian, finite di- 
mensional, connected, without boundary) spin manifold, one has 

W{ipi,ip2) = dDi^l,^P2)- 

Proof, i) It is well known [2] that on M" X-Lipschitz functions are the uniform limit of smooth 
X-Lipschitz functions (for any K > 0). It may be less known that the same is true for any (finite- 
dimensional) Riemannian manifold A4, and for i^-Lipschitz functions vanishing at infinity. Let 
us give a short proof of this result. 

According to Theorem 1 in [2] (that is valid for separable Riemannian manifolds, so in 
particular for finite-dimensional manifolds), given a Lipschitz function /, for any continuous 
function e : — )• M"*" and for any r > there exists a smooth function g^^r : — )• M such that 

Ibe.rllLip < ||/||Lip + ?■ and \f{x) - ge,rix)\ < e{x) y X £ M. 

As a corollary, if / is a if-Lipschitz function vanishing at infinity, we can fix a sequence e„ of 
continuous functions vanishing at infinity and uniformly converging to zero, and a sequence of 
positive numbers r„ converging to zero in order to get a sequence of smooth functions g„ : — )• 
M such that 

\\gn\\up < K + rn and \f{x)-gn{x)\<£n{x) y X £ M. 

Obviously gn vanishes at infinity (since both / and e„ vanish at infinity). Let us call := 
K{K + rn)~^gn- The /n's are the required smooth iT-Lipschitz functions vanishing at infinity 
that converge uniformly to /. Indeed 



implies 



K r 
sup \f{x) - fn{x)\ < — — — sup |e„(x)| + -—^ — sup |/(x)|, 

X IS. -\- Tji X ~r X 
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and the right hand side goes to zero for n — oo since sup^. — )• and r„ — by assumption, 
while sup^ 1/(2^)1 is finite since / is continuous vanishing at infinity. 

ii) By (2.6) the supremum in (1.1) is on 1-Lipschitz smooth functions vanishing at infinity. 
By i) above any 1-Lipschitz function / in Co(A^) can be uniformly approximated by smooth 
1-Lipschitz functions /„ in C^{Ai). Since any state (p of Co{Ai) is continuous [3] with respect 
to the sup-norm, namely 

lim ||/-/n||oo=0 =^ lim (p{fn) = ^if), 

the supremum in (1.1) can be equivalently searched on continuous functions and the spectral 
distance writes 

dD{^Pi,V^2)= sup I / /d/ii - / /d/i2)- (2.7) 

/eCo(>!,IR),||/||Lip<l V./>! Jm J 

iii) In case M is compact Co(A^,M) coincides with C(A^,M) and (2.7) equals (2.5), hence 
the result. In case M. is only locally compact, Co(A^,M) C C(A^,M) so that dD{^i,f2) < 
W{ipi,ip2)- To get the equality, it is sufficient to show that to any 1-Lipschitz /ij-integrable 
function / G C(7W,M) is associated a sequence of functions /„ G Co(7W,M) such that 

ll/n||Lip<l (2.8) 

and 

lim A(/„) = A(/), (2.9) 

where A(/) = (pi{f) — ^2{f)- We claim that such a sequence is given by 

Ux) = /(x)e-'^(^O'^)/", n G N, (2.10) 

where xq is any fixed point. Indeed since M is complete, d{xQ^ x) diverges at infinity as explained 
in Section 2.3 below; by the 1-Lipschitz condition |/(x)| < |/(xo)| + d{xQ,x), and this proves 
that \fn{x)\ < (|/(xo)| + d(xo, x))e~'^(^"'^)/" vanishes at infinity. 

To obtain (2.8), one first notices that A(/ + C) = A(/) for any C G M, so that we can assume 
without loss of generality that /(xq) = 0, that is to say 

1/(^)1 = |/(x)-/(xo)|<a!(xo,x). 

Second, from V/„ = {V f — n~'^ fVd{xQ,o)^ e~'^^^'^'°^/'^ and remembering that both / and 
d(xo,o) are 1-Lipschitz, one gets 



|V/„| < (l + n-i4xo,o))e 



-n ^d{xQ,o) 



The inequality (2.8) then follows from (1 + ^)e"^ < 1 V ^ > 0. (2.9) comes from Lebesgue's 
dominated convergence theorem: |/n(x)| < \ f{x)\ V x, n and |/| is /ij integrable by hypothesis, 
so that lim J f^dfn = Jj^ fd^ii. ■ 

2.2 Alternative definitions 

There exist several equivalent formulations of the spectral distance. First one may consider 
continuous instead of smooth functions. Indeed, as explained in [11], for any measurable bounded 
function / one can view [D, f] as the bilinear form 



C,rj^{D^,f7j)-{rtD7]), 
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well defined on the domain of D (a dense subset of Therefore [D,f] makes sense 

also when / is not smooth and one can define [12] the spectral distance as the supremum on all 
continuous functions / S Co(M) with ||[-C',/]|| < 1, that is the set of 1-Lipschitz functions, ob- 
taining thus directly (2.7). In the literature one finds both definitions: supremum on continuous 
functions [12, 23] or on smooth functions [13, 16]. 

Second, one may be puzzled by the use of the spin structure to recover the Riemannian metric. 
In fact instead of the Dirac operator one can equivalently use, as explained in [14], the signature 
operator d + S acting on the Hilbert space L'^{j\4, A) of square-integrable differential forms, or 
the de-Rham Laplacian A = dS +d^d acting on L^(A^). Here d is the exterior derivative and d^ 
its adjoint with respect to the inner product [23] 

{uj,uj') = / {uj,uj')i'g y e l'^{M,a) 
Jm 

with Vg the volume form and the inner product on fc-form given by 

(dx"! A • • • A dx""" , dx^^ A • • • A dx^"' ) = 6kk' det (c/"'^^ ) l<i,j <k. 

The action of both operators only depends on the Riemannian structure and suitable commu- 
tators yield on self-adjoint elements / = /* in Cq°(A^) the same semi- norm as the commutator 
with D. Exphcitly^ 

||[(i + dt,/]2||„p= i||[[A,/],/]||„p = ||[Z),/]||2p. (2.11) 

To show these equalities, let us note that on L^(7W, A), [d, f] = e{df) where e denotes the wedge 
multiplication, 

e{df)ijj = df Auj. 

Therefore [d, f] commutes with 0-form, and the same is true for its adjoint [(i, /]^. Assuming 
/ = /*, that is [d^,/] = —[d,f]^, few manipulations with commutators yield 

[[A,/],/] = [[d,f]d\f] + [d[d\f],f] + [[d\f]d,f] + [d^[d,f],f] (2.12a) 
= 2[d,f][d\f] + 2[d\f][dJ] (2.12b) 
= 2[d + dt,/]2, (2.12c) 

where [d, /]^ = [d^, /]^ = due to the graded commutativity of the wedge product. Remembering 
that e{dxf^)^ = g^'^ i{dy) where l is the contraction (see e.g. [23]), one gets [d, /]^ = ^{df)'^ = 
{{d^f)e{dxt'))^ = {gt"'d^f)L{d^) so that (2.12b) becomes 

[[A, /],/] = -2d,fig^''d,f) {e{dxP)i{d^) + i{d^)e{dxP)) = -2g''Pdpfd,f, 

where we used the fact that e{dxP)i{d^,) + i{d^)e{dxP) = 5^. With (2.12c) this shows that 
[d + d\ f]'^ = ^ [[A, /] , /] is the operator of point-wise multiplication by the function g^'^dufd^f, 
whose operator norm is \\gP''^d^fduf\\oo- (2-11) then follows from (2.6). 

Notice that instead of the Laplacian one could use any other 2nd order differential operator 
with the same principal symbol. 



^Although we use the same symbol, one should keep in mind that the three operator norms in (2.11) correspond 
to actions on different Hilbert spaces: L'^{M,A), L'^{M), L'^{M,S). 
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2.3 On the importance of being complete 

At point iii in the proof of Proposition 2.1 it is crucial that M is complete. By the Hopf-Rinow 
theorem a complete finite-dimensional Riemannian manifold is a proper metric space, hence the 
geodesic distance from any fixed point xq, x i— )• (i(xo,x), is a proper map [42]. In particular for 
non-compact Ai this means that d{xQ,x) diverges at infinity, so that the functions /„ in (2.10) 
vanish at infinity. 

When M is not complete, not only the /„ do not vanish at infinity which spoils the proof, 
but also the definition of the Wasserstein distance requires more attention. Indeed in [43] 
Kantorovich duality is proved assuming X is complete. It is not clear to the authors whether 
the duality holds in the non-complete case. However one can still take (2.5) as a definition 
of W , letting aside whether this is the same quantity as (2.2) or not. Then it is easy to see 
that on non-complete M. the spectral distance and W are not necessarily equal. Suppose indeed 
that N is compact, and M is obtained from M by removing a point xq, so that Ai is locally 
compact and not complete. The spectral distances d/^ and d-^ computed on M and M are equal. 
Indeed by (2.7), which holds also in the non-complete case, we can compute as supremum 
over continuous functions instead of smooth ones; in the computation of the spectral distance 
on AA, it is equivalent to take the supremum over 1-Lipschitz / G C{M) or over 1-Lipschitz 
/' = f — f{xo) S Ci^M) vanishing at xq (since v?i(/) — ^2{f) = ^lif) — ^2{f') for any two states 
fi,V2)', therefore 

dD{^i,^2)= sup {v'i(/) - (/?2(/); ll/llLip < 1} 
/eC{A^) 

sup {Mf)-Mf);\\f\\up<-^} = d^iipi,cp2), 

feC{Af),f(xo)=0 

where in last equality we used Co{M) = {/ G C{Af), /(xq) = 0}. 

On the contrary, W computed between pure states coincide with the geodesic distance, and 
the latest may or may not be the same on J\f and Ai. For instance one does not modify the 
geodesic distance by removing a point from the two sphere. But taking for M the circle, thought 
of as the closed interval [0, 1] with and 1 identified, and Ai the open interval (0, 1), one gets 
Wf>j-{x, y) = min{|a; — y\,l — \x — y\} whereas Wj^ix, y) = \x — y\. Thus on = (0, 1) 

d^{x,y) = d^{x,y) = WM{x,y) + WM{x,y). 

2.4 On the hypothesis of finite moment of order 1 

Restricting to the states with finite moment of order one (cf. (2.4)) guarantees that Wasserstein 
distance is finite, since by (2.5) 

W{ipi,Lp2)= sup (/ {f{x) - f{xo))d^i{x) - {f{x) - f{xo))d^2ix) 

||/||Lip<l \JX JX 



< sup / ]/(x) - /(xo)|d;Ui(x) + sup / ]/(x) - /(xo)|d/i2(x) 

ll/l|Lip<l^A' \\f\\Lip<lJX 



< / (i(x, xo)d/ii(x) + / d(x, xo)d/i2(x) < oo. (2-13) 

Jx JX 

An obvious upper-bound is then obtained by choosing vr = x /i2 in (2.2): 

doivi, ^2) = W{^i,^2) < IE(d; m X /i2). (2.14) 

When at least one of the states is pure, this upper bound is an exact result, even outside 
Si{C^{M)). 
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Proposition 2.2. For any x £ A4 and any state ip G 5(Co°(A^)), 

dDi^p,6x) = W{ip,dx) = E{d{x,o); fi). 
Proof. By Proposition 2.1, 



dD{<f,Sx) = W{(f,6x)= sup ( / /d//-/(x)) 

ll/llLip<i,/eLi(M) \JM J 

sup (/ (/(y) - /(x)) d/x(y) 
,<i,/eLi(M) \JM 



ll/l|Lip<i,/eLi(M) 

< sup / |/(x)-/(y)|dMy) 

ll/llLip<l,/GLi(M) JM 

< / d{x,y)dfi{y) = E{d{x,o);fi). 
JM 

This supremum is attained on the 1-Lipschitz functions f{y) = d{x,y) in case fj, G 5i(C^(A^)), 
or is obtained by the sequence fn as defined in (2.10) in case ^ has not a finite moment of 
order 1. ■ 

Notice that this proposition does not rely on the finiteness of W nor do, and makes sense 
since K(^d{x, o); fij is either infinite or convergent, the integrand in (2.4) being a positive function. 
When the distributions are both localized around two points, x, y, transportation maps are 
simply paths from x to y and the minimal work coincides with the cost c(x, y) to move from x 
to y, i.e. with the geodesic distance. In other words 

dDi6,,5y) = W{6,,6y) = dix,y) (2.15) 

and one retrieves (1.5). Proposition 2.2 also yields an alternative definition of Si{C^{Ai)). 

Corollary 2.1. ip G Si{C^{M)) if and only if ip is at finite spectral-Wasserstein distance from 
any pure state. 

Proof. By the triangle inequality, the moment of order one of (p is finite for a fixed x = xq if 
and only if it is finite for all x G A^. ■ 

Let us conclude this section with some topological remarks. 

Definition 2.1. Given an arbitrary spectral triple {AjT-LjD), we call [33] 

Con((/?) = {if' G S{A), d£){ip,ip') < +oo}. 

Notice that connected components in S{A) for the topology induced hy do coincide with sets 
of states at finite distance from each other, thus justifying the name Con{ip). Indeed Con{ip) is 
path-connected since for any ipo,(pi G Con((/?) the map 

[0, l]3t^ipt = {l- t)(po + tipi G Con{ip) 

is continuous (for all e > called 6^ = e/d{ipo, ipi) from (1.9) we get \t—s\ < 5e =^ d£i{ipt, (ps) < e)- 
That Con((/?) is maximal - i.e. there is no connected component containing it properly - can be 
easily seen: for any ip' G Con((/j) any open ball centered at tp' is contained in Con{ip), so that 
Con((/9) is open; by the triangle inequality the same is true for the complementary set, proving 
that Con((/3) is also closed. Therefore, any set containing properly Con(99) is not connected since 
it contains a subset that is both open and closed. 

For the Wasserstein distance, the set of states with finite moment of order one is a connected 
component. 
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Corollary 2.2. For any if G Si{C^{M)), Con((p) = Si{C^{M)). 
Proof. Let ip £ Si{C^{M)). By the triangle inequality, 

is finite by Corollary 2.1 as soon as ip' £ Si{C^{Ai)). Similarly 

dni^x, f') < dD{5x, (f) + doi^p, f') 
implies that d£){{p,{p') is infinite when ip' ^ Si{C^{M)). ■ 
Notice that considering only pure states, the set 



is not necessarily (path)-connected in V{A). In the example (1.7) Con{(p) is indeed a connected 
component of V{A); but in the standard model (see Section 4.2) Con{ip) = ViA) and contains 
two disjoint connected components {M.,6c), (A^,5h)- 

3 Bounds for the distance and (partial) explicit results 

Having in mind that noncommutative geometry furnishes a description of the full standard model 
of the electro- weak and strong interactions minimally coupled to Euclidean general relativity [8] , 
computing the spectral distance could be a way to obtain a "picture" of spacetime at the 
scale of unification. Regarding pure states, this picture has been worked out in [35] and is 
recalled in Section 4: one finds that the connected component of d^ is the disjoint union of 
two copies of a spin manifold M, with distance between the copies coming from the Higgs field. 
Extending this picture to non-pure states is far from trivial since already on Euclidean space 
explicit computations of the Wasserstein distance are very few. This does not seem to be the 
most interesting issue in optimal transport, where one is rather interesting in determining the 
optimal plan than computing W. On the contrary from our perspective computing do is of 
most interest, while finding the optimal plan (i.e. the element that reaches the supremum in 
the distance formula) is not an aim in itself. In this section, we collect various results on the 
Wasserstein-spectral distance in the commutative case A = C^{A4): upper and lower bounds 
for the distance on any spin manifold Ai, and explicit result for a certain class of states in case 
Ai = M". Some of these results might be known from optimal transport theory, but we believe 
it is still interesting to present them from our perspective. 

We postpone to the next section a discussion of the noncommutative case. In all this section, 
do = W and to avoid repetition we simply call it the spectral distance. 

3.1 Upper and lower bounds on any spin manifold 

On a spin manifold, a lower bound on the spectral distance between states with finite moment 
of order 1 is given by the distance between their mean points. For Ai = the mean point of 
ip G Si{Cq^ {M.)) is the barycenter of ^, namely 



Con((^) = Con{ip)r\V{A), 



ip G V{A) 





with 





where 



X 



m 



— >• 



X !->• x"(a;) 
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with a = 1, . . . ,m denote a set of Cartesian coordinate functions on M"^. For Ai that is not the 
Euchdean space, the mean point can be defined through Nash embedding, that is an isometric 
embedding of M. onto a subset of the Euchdean space M" for some n > m [37, 38]. We say 
that the embedding : — >• is convex if Al is a convex subset of M". In that case, the 
barycenter 5; of /i = A,,/x is in Al, and 

X = N~^{x) G M 

is a weU defined generahzation of (3.1) to AJ. 

Before showing in Proposition 3.1 that the distance between mean points bounds from below 
the spectral distance, let us recall two properties of Nash embedding that will be useful in the 
following. 

Lemma 3.1. Let M. he a Riemannian manifold admitting a convex isometric embedding N : 
Al ^ A( C M". Then 

d{x,y) = \N{x)-N{y)\, (3.2) 

where \ ■ \ is the Euclidean distance on M". Moreover, if f is a 1-Lipschitz function on the 
metric space {M-,d) then f = f ° is l-Lipschitz on {A4, \ • |). 

Proof. An isometry A : Al — )• Ai preserves the distance function and sends geodesies to 
geodesies (cf. e.g. [9, page 61]). If Al is convex, since the metric on A4 is the restriction of the 
Euclidean metric of M", geodesies are straight lines and the distance is the Euclidean one. This 
proves (3.2). 

As a consequence, if / is a 1-Lipschitz function on A4, for any x,y G AJ we have 
\f{x)-f{y)\ = \f{x)-f{y)\<d{x,y) = \x-y\, 
where x = N^^{x) and y = N^^{y). This means that / is 1-Lipschitz on A(. ■ 

Proposition 3.1. Let Ai be a Riemannian spin manifold that admits a convex isometric em- 
bedding M ^ M". For any states ipi, ip2 in S{Cq{M)) with mean points xi, X2, 

d{xi,X2) < doi^i, ip2) < E,{d; m x 112) . 

Proof. (2.14) holds true, so we need to prove only the lower bound. Let us fix a basis of the 
vector space M" such that 

ii - ^2 = (1^1 - ^2|,0, . . . ,0), (3.3) 

where Xi = N{xi), i = 1,2, has Cartesian coordinates 

x^ d/ij, /3 = 1, . . . , n. 



IM 

(3.3) is equivalent to 



/.'d^,-/ .MA. = |%-i.l and /L.*dA,-/ .*d^. = for ^.[2.„1 

Jm Jm Jm Jm 

Therefore 



\Xl - X2\ 



/ xM/ii- / xM/i2 < sup / /d/ii- lfdiJ-2 
Jm Jm \\f\\i^,^<, Jm Jm 
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where we noticed that x 



is Lipschitz in M with constant 1 since 



X 



\x) - x\y) < J (xi(x) - x\y)f + ^.^^^ {x'^i.x) - y^{x)f 



= \x- y\. 




Proposition fohows from (3.2), i.e. \xi — X2I = d{xi,X2)- 



Note that when M does not admit a convex embedding. Proposition 3.1 still holds with 
\xi — X2I instead of d[xi,X2)- However this might not be the most interesting lower bound since 
it involves a distance on M" that is not the push-forward of the one on M (the points Xi may 
not be in A^, and the Euclidean distance, even when restricted to A^, is not the push- forward 
of the geodesic distance on A^). 

3.2 Spectral distance in the Euclidean space 

In this section we study the spectral distance between states of 5i(Co°(A^)) in the case M. = M™". 
To a given density probability ^ S L^{W^) with finite moment of order 1, e.g. a Gaussian 
'(/'(x) = 7r~ 2"e~l^l , one can associate a state ^a,x given by 



for any x € and a £ M"*". This becomes the pure state (the point) 6x in the a — )• 0"*" limit 
since lim TpcT,x{f) = f{x). In this sense '^a,x can be viewed as a "fuzzy" point, that is to say 



a wave-packet - characterized by a shape ■0 and a width a - describing the uncertainty in the 
localization around the point x. 

The spectral distance between wave packets with the same shape is easily calculated. 

Proposition 3.2. The distance between two states "^a,x o,nd "^a'^y is 




cr-S>0+ 




(3.4) 



In particular, for a = a' 



■' the distance does not depend on the shape tp: 



dD{'^u,x,'^<T,y) = \x-y 



Proof. Since f{z) — f{w) < \z — w\ for any 1-Lipschitz function /, we have 




When a = a' the upper bound is attained by the function 



h{z) = z ■ 



x-y 



x-y 
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X+a| 




Y+ o'l 




Figure 1. The function h that attains the supremum, in case a > a' and a 
to the gradient of h. 



a' . Dot hnes are tangent 



while for a ^ a' it is attained by the function 

h{z) = \z — a\ , 

where 

. a'x — ay 



a 

a' — a 

as shown in Fig. 1. 



(3.5) 



In case a ^ a' the function h that attains the supremum measures the geodesic distance 
between z G M"* and the point a defined in (3.5). GeometricaUy the latest is the intersection 
of (x,y) with (x + cr^,y + o"'^), and is independent of ^. In case a = a' , a is send to infinity 
and h measures the length of the projection of z on the (x, y) axis (see Fig. 1). The picture is 
still valid for pure states, i.e. a = a' = 0: h can be taken either as the geodesic distance to any 
point on {x,y) outside the segment [x,y], or as the distance to any axis perpendicular to {x,y) 
that does not intersect [x,y]. 

For Gaussian shape, Proposition 3.2 can be confronted with the Wasserstein distance of 
order 2 between Gaussians computed in [21]. The final expression for the distance of order 2 
is quite simpler, since it is an algebraic function of the means and covariance matrices. Here, 
computing the integral in (3.4) for ip a Gaussian with the help of a symbolic computation 
software, one finds complicated expressions involving Bessel functions. 

The distance between two arbitrary states on M™ is less easily computable. Let us first recall 
what happens in the one dimensional case (cf. e.g. [19]). 

Proposition 3.3. For any two states fi and (p2, whose corresponding probability measures 
and fj,2 have no singular continuous part, 



+00 



(d/ii(o-d/i2(e)) 



dz. 



Proof. Integrating by parts one finds 



/'(z)A(z)dz, 
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where 

A{z)= r {d^,,{0-d^^2{0) (3.6) 

is the cumulative distribution of the measm'e /Ui — fi2- We are assuming that /Xj are the sum of 
a pure point part and a part that is absolutely continuous with respect to the Lebesgue measure. 
In this case A{z) is piecewise continuous, its sign is a piecewise continuous function, and the 
primitive h{z) of the sign of A(z) is a Lipschitz continuous function. Now, the 1-Lipschitz 
condition says that a.e. |/'(^;)| < 1 on M, hence 

dD{fi,'P2) < / \A{z)\dz. 

JR 

This upper bound is attained by any 1-Lipschitz function h such that 

h'{z) = 1 when A{z) > 0, h'(z) = —1 otherwise. 
By the above consideration, such a 1-Lipschitz function (at least one) always exists. ■ 

In a quantum context, one can view the cumulative distributions Cj = J^d^i{£,) as the 
probability to find the particle on the half-line (— oo, z] before the transport (i = 1) or after the 
transport (i = 2). A(z) in (3.6) measures the probability flow across z, and the Wasserstein 
distance is the integral on M of the modulus of this probability flow. 

On M™ with m > 1 there is no such an explicit result. If ipi, 1/^2 are bounded Lipschitz 
functions with compact support, we know from [19] that there exist two (a.e. unique) bounded 
measurable Lipschitz functions a,u : — )• M such that a > 0, 

— V(aVn) = ipi — ip2 

in the weak sense, and |Vu| = 1 almost everywhere on the set where o > 0. We have then 

dD{(pi,^2) = / a{x)dx. 
Indeed integrating by parts, we can write 

Mf)-^2{f)= I (V/)-(aVu)dx 
and since \aVu\ = a we have 

dD{(pi,^2) < / a{x)dx . 
The sup is attained by the function f = u. 

4 Noncommutative examples 

At the light of Proposition 2.1 one may wonder if the analogy between the spectral and the 
Wasserstein distances still makes sense in a non-commutative framework. In other terms, for A 
noncommutative is the distance on S{A) computed by (1.1) related to some Wasserstein dis- 
tance? 

The most obvious answer, based on Gelfand's identification (1.4) between points and pure 
states, would be to consider the Wasserstein distance Wd on the metric space {V{A), do), and 
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question whether Wd on the set Prob(T'(^)) of probabihties distributions on Vi^A) coincides 
with do on S{A). This is obviously true for pure states since by (2.15) 

WDioJi,uj2) = c{u}i,uj2) = d£)(a;i,a;2) V uJi,u!2 G ViA). 

For non-pure states however this is usuahy not true. The reason is that even if 

S{A) C Prob(P(^)) 

for commutative and almost commutative C*-algebras (see the definition in the next paragraph), 
there is not a 1-to-l correspondence between the two sets (except in the commutative case). For 
instance, as recalled in Section 4.1, 5(M2(C)) is a non-trivial quotient of Prob(P(M2(C))), mea- 
ning that two probability distributions cpi / cp2 may give the same state Lfi = ip2. Thus, given 
a spectral triple on M2(C) as the one studied in [7], one has duCvi; V2) = while Woifpi, 4'2) / 
since the cost being a distance (and assuming V{A) is a polish space), is a distance and 
vanishes if and only if = (/>2. 

However this does not mean that the optimal transport interpretation of the spectral dis- 
tance loses all interest in the noncommutative framework. As explained in Section 4.2, there 
are interesting analogies between the spectral distance and some Wasserstein distances other 
than Wd in almost-commutative geometries. The latest are spectral triples {A,T-L,D) obtained 
as the product of a spin manifold by a finite-dimensional spectral triple {AijHi, Dj), namely 

A = C^iM)(E)Ai, n = L2iM,S)(E)ni, D = -ij^df, + -/""^^ (g) Dj, (4.1) 

where 1/ is the identity of T-Li and 7™+^ is the product of the gamma matrices in case m is even, 
or the identity in case m is odd. Ai being finite-dimensional (or, equivalently, C^{Ai) being 
Abelian) one has [27]: 

V{A) = V{C^{M)) X V{Ai). (4.2) 

For simplicity we discuss here the case Ai = acting on "H/ = C^, while in Section 4.2 we 
will focus on the finite-dimensional spectral triple describing the internal degrees of freedom 
(hence the subscript /) of the standard model of particle physics. Since has two pure states - 
^o{zo,zi) = zq, 6i{zo,zi) = zi V {zo,zi) e - from (4.2) 

ViA)=Mx {0,1} 

This is Connes' idea of "product of the continuum by the discrete" [11] seen at the level of pure 
states: through the product by a finite-dimensional spectral triple, the points of the manifold 
acquire a Z2 internal discrete structure, namely to a point x = 6x in the commutative case 
corresponds two pure states in ViA), 

xo = {5x,So), xi = {5x,Si). 

Moreover, although V{A) is the disjoint union of two copies of Ai, points on distinct copies need 
not to be at infinite spectral distance from one another. Indeed one shows that [35] 

dDixo,xi) = dDj{5o,6i), (4.3) 

where dDj denotes the spectral distance associated to [AijHi, Dj). Aj is represented on Hj 
by diagonal matrices but Dj G M2{C) needs not to be diagonal. Especially, if Dj has non-zero 
off-diagonal terms then ||[Z),a]|| < 1 is a non-trivial constraint which guarantees that (4.3) is 
finite. Note that if one takes the direct sum of spectral triples instead of a product, namely 
Cq^{M) © Cq°{M) acting on L2{M,S) © L2{M,S), one gets the same pure states as above but 



A View on Optimal Transport from Noncommutative Geometry 



17 



D = —ij^dfx(B —i^^d^ does not have off-diagonal terms and the two copies of M are at infinite 
distance from one another (points have not enough space to "talk to each other" through the 
off-diagonal terms of D). 

From an optimal transport point of view, the non-vanishing of (4.3) can be interpreted as 
the fact that "staying at a point", which is costless in the commutative case since c{x,x) = 
dnix^x) = 0, may have a cost 

dD{xo,xi)^0 (4.4) 

in almost-commutative geometries, corresponding to the "internal jump" from xq to Xi. We 
investigate this idea in Section 4.2, showing in Proposition 4.2 that the spectral distance between 
pure states is the minimal work Wj associated to a cost c/. 



4.1 The Moyal plane 

The Moyal algebra Ae is the noncommutative deformation of the non-unital Schwartz algebra 
5(M2) with point -wise product 

(/*5)(x) = ^ / dV^/(x + y)<7(x + z)e-^2,e-i. v/,5G5(m2), 
where yQ^^z = y^Q'^lz^ and 

J) 

with G M, 6* 7^ 0. In [7] we studied the spectral distance associated to the spectral triple built 
in [20] around the action of Ae on L2(M^) and the usual Dirac operator on M^. We found that 
the topology induced by the spectral distance on S{Aq) is not the weak*-topology (a condition 
required by Rieffel [39, 40, 41] in the unital case in order to define compact quantum metric 
spaces, and adapted by [30] to the non-unital case). However by viewing Ae as an algebra of 
infinite-dimensional matrices, we proposed some finite-dimensional truncations of the Moyal 
spectral triple, based on the algebra -/Vf„(C), n G N, that makes it a quantum metric space. 
Explicitly, for n = 2 P(M2(C)) is homeomorphic to the Euclidean 2-sphere, 

i G P(M2(C)) ^ xg = {x^,y^,z^) G S\ 

so that a non-pure state ip is determined by a probability distribution (/> on 5^. Its evaluation 

f{a) = j 0(x^) Tr(s^o) dx^ = Tr (yjyj 4>{'^£)s^d-x.^ V a G M2(C), 

with G M2(C) the support of the pure state //~^(x^) and dxg the SU{2) invariant measure 
on 5^, only depends on the barycenter of (j), 

= (S</.,2/0,y<^) with X0 := / 0(x^)xgdx^ 

and similar notation for y^, z^. With the equivalence relation (j) ^ (j)' x<^ = x^/ one gets 
that 

S{M2{C))=S{C{S^))/ ^ 
is homeomorphic to the Euclidean 2-ball: 
ip — ^ x^ G S . 
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Figure 2. The vertical plane containing x^, x^'. 



In [7] we computed 
(iz)(x<i,x^/) = 




cos a (i£;c(X(^, X0') when a < — , 

1 _ TT 

when a > 



1, 1 sm a 4 

where d£;c(x^,X0/) = |x0 — X(^/| is the Euclidean distance and a is the angle between the segment 
[x0,X0'] and the horizontal plane = const (see Fig. 2). 

The state space of the truncated Moyal algebra is not the space of distributions on 5^, but 
a quotient of it. Therefore for (j) ^ one has dD{f,<f') = while Wd{^,^') / 0. 

In this example there seems to be no Wasserstein distance naturally associated to the spectral 
distance. In the next section, we exhibit another noncommutative example for which there 
exists a Wasserstein distance W , with cost d' defined on a set bigger than ViA), that coincides 
with do for some non-pure states. 



4.2 The standard model 

The spectral triple describing the standard model of elementary particles is the product (4.1) of 
a manifold with the finite-dimensional algebra C©]HI©M3(C) acting on Tij = C^^. Dj is a 96 x 96 
matrix with entries the masses of the elementary fermions, the Cabibbo-Kobayashi-Maskawa 
matrix and the neutrino mixing- angles. The choice of the algebra is dictated by physics [8] (its 
unitaries group gives back the gauge group of the standard model) and 96 is the number of 
elementary fermions^. The pure states of M3(C) turn out to be at infinite distance from one 
another and from any other pure state [35] so that, from the metric point of view, the interesting 
part is the product {A, T-l, D) of a manifold by {Ai = C © H, Hj^ Dj). By (4.2) the pure-states 
space is 

ViA)=Mx {0,1} 

since both C and the algebra of quaternion HI have one single pure state^ 

5c{z) = ^{z) VzGC, (5h(/i) = ^Tr/i y h = ^ ^ G M, a, /3 G C. 

''(6 leptons + 6 quarks x 3 colors) x 2 chiralities = 48, to which are added 48 antiparticles. 

""H is a real, but not a complex algebra. Real C*-algebras are defined similarly to complex ones [22], except 
that one imposes that 1 + a*a is invertible - which comes as a consequence in the complex case. A state (p is then 
defined as a real, real-linear, positive form such that ip{T) = 1 and <^(a*) — ip{a). Hence H has only one state, 
given by half the trace. Viewing C and Cq°{M) as real C*-algebras, their states are obtained by taking the real 
part of their "usual" complex states. See [31, page 42]. 
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The space underlying the standard model is thus the disjoint union of two copies of the manifold, 
and with some computation one shows [35] that the spectral distance is finite on V{A) and 
coincides with the geodesic distance d' on the manifold Ai' = Ai xl, with I := [0, 1] the closed 
unit interval, with metric 

g'^'ix) 
\\Dj 



2 ), (4.5) 



where g is the Riemannian metric on M. Assuming that Ai is complete, and writing t G X 
the extra-coordinate, this can be restated as: the spectral distance between pure states in 
the standard model coincides with the Wasserstein distance W' on the metric space {Ai',d') 
restricted to the two hyperplanes t = 0, t = 1. This reformulation of the main result of [35] 
allows to determine the spectral distance between a certain class of non-pure states, namely 
those localized on one of the copies of Ai. 

Explicitly, S{A) is the set of couples of measures ip = {fi, u) on A^, normalized to 

d/x + / du = 1, 
M Jm 

whose evaluation on 

9 b 



' ' -b g 
where f,g,b are in C(f(7V(), is given by 



ifia) = / dfi + / d,^. 

Jm Jm 

Two states ipi, ip2 are localized on the same copy of Ai ii ipi = (0, z^i), = (0,z/2); or 
fi = (/^ijO), ip2 = {fJ'2,0). Such states can be viewed as elements of S{A), S{C^{Ai')) and 
S{C^{M)). 

Proposition 4.1. For two states ipi, ip2 localized on the same copy of Ai, 

dnifi, f2) = W{lpi,'^2) = W'{(pi,(p2). (4.6) 

Proof. To fix notation we assume that ipi = (0, ui), ip2 = (0, ^2) are localized on the 6m copy 
of Ai, that we write Ai^ and associate to the value t = 1 of the extra-parameter t G I. The 
evaluation of (^^i — on a G .A only depends on Re(g). Moreover for a = a*, ||[D,a]|| < 1 
implies that g is 1-Lipschitz [35, equation (16)], so that 

dDi^i,V>2) <W{ipi,^p2). (4.7) 

The equality is attained by considering a = g^ where g^ is the real 1-Lipschitz function 
that attains the supremum in the computation of W. Hence the first equation in (4.6). 

To show that W{ifi,ip2) = W {ipi, ip2), one first notices that (4.5) being block- diagonal 
implies that 1), (?/, 1)) = d{x,y). Hence the restriction g{y) = g'{y,l) on Aia of any 

1-Lipschitz function g' in C^{Ai') is 1-Lipschitz, 

\g'{x,to) - g'{y,ti)\ < d'{{x,to),{y,ti)) \g{x) - g{y)\ < d{x,y). 
Conversely to any 1-Lipschitz function g G C^{Ai) one associates 

g'{x,t)=g{x) yx£M,t£l 
which is 1-Lipschitz in C^{Ai') and takes the same value as g on Ai^.- Therefore 

sup Iviig') - 'P2{g')\ = sup \vi{g) - 'P2{g)\ 

9'eCg-{M'),\\g'\\up=i 9eCo-(M),||g||Lip=i 

which, together with (4.7), yields the result. ■ 



20 



F. D 'Andrea and P. Martinetti 



It is tempting to postulate that (/92) = W {ipi, Lp2) for states locaUzed on different 

copies, or states that are not locahzed on any copy (i.e. (p = (/u, z^) with fi, v both non-zero). 
This point is under investigation. 

A disturbing point in the computation of the spectral distance in the standard model is the 
appearance of a compact extra-dimension X while the internal structure is discrete. In [35] 
X came out more as a computational artifact than a requirement of the model. From the 
Wasserstein distance point of view, the introduction of the extra-dimension can be seen in the 
following proper inclusions: 

V{A) CM' C SiA). 

Namely W is associated to the metric space {Ai',d') which is bigger than V{A) (the points 
between the sheets are not pure states of A) and smaller than S{A) (non-pure states localized 
on a copy are not in M'). To get rid of the extra- dimension, one could consider the metric space 
{V{A),dD) with associated Wasserstein distance Wd, but this would be of poor interest since 
the definition of do = d' requires the knowledge of M'. Alternatively one could look for a cost 
defined solely on Ai. For states (pure or not) localized on the same copy, this cost is simply the 
geodesic distance on A^, as shown in Proposition 4.1. For pure states on distinct copies such 
a cost cj also exists and is given by 

ci{x, y) = y^cZ(x,y)2 + p^. (4.8) 

Note that c/ is not a distance since it does not vanish on the diagonal, 
c/(x,x) = ||-^ VxGA^, 

but gives back the jump-cost (4.4) (by (4.5), d{xQ,xi) = pj^)- However c/ satisfies the condi- 
tion required to proved Kantorovich's duality (namely [43] it is lower semicontinuous and satisfies 
cj{x,y) > a{x) +b{y) for some real-valued upper semicontinous /ij-integrable functions). Hence 
it makes sense to consider the minimal work Wi associated to cj, whose formula is given by (2.5) 
with the Lipschitz condition replaced by \ f{x) — f{y)\ < cj{x,y). 

Proposition 4.2. The spectral distance between pure states xq = {5x,6c), yi = {6y,Su) on 
distinct copies is 

doixQ^yi) = Wi{x,y). 

Proof. For pure states the minimal work is the cost itself: Wj{x, y) = ci{x, y) for any x,y £ Ai. 
Since doixo^yi) = d'{xQ,yi) as recalled above (4.5), the result follows if one proves that 

d'\xo, yi) = d^{x, y) + — (4.9) 

This has been shown in [35] but we briefly restate the argument here for sake of completeness. 
Let us write = {x^^ G M,t G [0,1]) a point of M' and g** = g~[-i^ = \\Di\\'^ the extra-metric 
component. The Christoffel symbols involving t are 



= r;,, = -g'd^gt, rr = --g^^d.gtu = F^:, = F*, = F^,, = 0. 



The geodesic equation + F^^i; i;^ writes 

x' + g'\^^gtt)x'x^' = (4.10a) 



A View on Optimal Transport from Noncommutative Geometry 



21 



- Ig^'id.gttWf + T'^^pX^xP = (4.10b) 
and, because gu is a constant, they simplify to 

i = const = K, (4.11a) 

+ r^^x^i;'' = 0. (4.11b) 

The first geodesic equation indicates that the proper length r of a geodesic G' = x(t) in A4' 
between xq and yi is proportional to t, 

dt = Kdr, 

as well as to the line element ds = y/g^i,dxf^dx'^ of Ai since 

II II ^ h dx^ dx^ ^ ds*^ ^ / . \ 

1 = \\x\\ = Gabx^x^ = g^,u — — + guK^ = j^+guK^, (4.12) 

so that, assuming guK'^ ^ 1 (which from (4.12) amounts to take x ^ y), 
ds 



dT 



x/1 - 9ttK^ 



The second geodesic equation (4.11b) shows that the projection on 7W of G' is a geodesic G 
of A4. Therefore 

d'ixo,yi)= [ dT= [ ^ds = — [ ds = —=^==d{x,y), 
Jg' Jg V 1 - 9ttK^ Jg VI - gttK^ 

that is to say 

d''^{xQ,yi) = d^{x,y) + guK'^d''^{xQ,yi). 
Writing K'^d''^{xQ,yi) as 



L 



Kdr] = ( [ dt] ={t{{y,l)-t{{x,0))f 



ig' 

one obtains (4.9), and the result. ■ 

Let us underline an interesting feature of this proposition, namely that a cost which is not 
a distance M. can be seen as a distance on x 

Proposition 4.2 can also be rewritten using a cost vanishing on the diagonal, namely 



ci{x,y) = ci{x,y) - ,, J" ..g = \ d{x,y)'^ + 



\Dj\\^ Y"v-'«/ ' ||^^||2 ll^^l 

Writing Wj the associated minimal work, one has 

dD{xo,yi) = Wi{x,y) + 



\Di 



12 • 



Quite remarkably, the cost (4.8) is similar to the cost (32) introduced in [6] in the framework of 
the relativistic heat equation. 

Proposition 4.2 relies on the fact that the jump-cost (4.4) is constant. From a physics point 
of view, this means that one does not take into account the Higgs field. In almost commutative 
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geometries, the latest is obtained by inner fluctuation of the metric [13] that substitute D with 
a covariant Dirac operator. From the metric point of view this amounts to replacing (see [35] 
for details) (4.5) by 

V \\Dj + H{xW)^ 

where 

Hix) = (|l + /ii(x)p + |/i2(x)p)m2 

where (/ii,/i2) is the (complex) Higgs doublet and mt is the mass of the quark top. Instead 
of (4.3) one has doixo, xi) = dDj+H(a;)('^C) Su) (the jump-cost is no longer constant). In analogy 
with (4.8), one could define the cost 

ci{x,y) = ^d{x,yy + dj^^^H{x){^c, Sr)"^ 

which allows to avoid the introduction of the extra-dimension. However the projection of 
a geodesic in Ai' is no longer a geodesic of M. ((4.10b) does not simplify to (4.11b)) so there is 
no way to express d'{xQ,yi) as a function of d{x, y), and doixo, yi) = d'{xo, yi) no longer equals 
Wi{xQ,yi). 

5 Conclusion 

The spectral distance between states on a complete Riemannian spin manifold coincides with 
the Wasserstein distance of order 1. In the noncommutative case the analogies between the 
spectral distance and various Wasserstein distances, although still not fully understood, shed an 
interesting light on the interpretation of the distance formula in noncommutative geometry. In 
physics, defining the distance as a supremum rather than an infimum is useful since, at small 
scale, quantum mechanics indicates that notions as "paths between points" no longer make 
sense, so that the classical definition of distance as the length of the shortest path loses any 
operational meaning. An interesting feature of noncommutative geometry is to provide a notion 
of distance overcoming this difficulty. In transport theory the interpretation of Kantorovich 
formula has an interpretation in economics rather than in quantum mechanics: while Monge 
formulation corresponds to the minimization of a cost, Kantorovich dual formula corresponds to 
the maximization of a profit. To repeat a classical example found in the literature: assume that 
the distribution of flour-producers on a given territory M is given by //i and the distribution 
of bakeries by fi2- Consider a transport-consortium whose job consists in buying the flour at 
factories and selling it to bakers. The consortium fixes the value f{x) of the flour at the point x 
(it buys the flour at the price f{x) if there is a factory, or it sells it at a price f{x) if there is 
a bakery). The Wasserstein distance is the maximum profit the consortium may hope, under 
the constraint of staying competitive, that means not selling the flour to a price higher than the 
bakeries would pay if they were doing the transport by themselves (i.e. f{x) < f{y) + c(x, y) 
for all X, y). This raises an interesting question: in a quantum context, what is the physical 
meaning of this "profit" that one is maximizing while computing the distance? If one view 
the states on A4. as wave functions, is the distance related to the minimum work required to 
transform one wave into the second? More specifically, for M = M"* what physical quantity 
represents (3.4), and what is the meaning of the point a? 
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